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ABSTRACT 

We measure the evolution in the intrinsic shape distribution of early-type galaxies from z ~ 1 
to z ~ by analyzing their projected axis-ratio distributions. We extract a low-redshift sample 
(0.04 < z < 0.08) of early-type galaxies with very low star-formation rates from the SDSS, based 
on a color-color selection scheme and verified through the absence of emission lines in the spectra. 
The inferred intrinsic shape distribution of these early-type galaxies is strongly mass dependent: the 
typical short-to-long intrinsic axis-ratio of high-mass early-type galaxies (> 10 11 Mq) is 2:3, where 
as at masses below 10 11 Mq this ratio narrows to 1:3, or more flattened galaxies. In an entirely 
analogous manner we select a high-redshift sample (0.6 < z < 0.8) from two deep-field surveys with 
multi-wavelength and HST/ACS imaging: GEMS and COSMOS. We find a seemingly universal mass 
of ~ 10 11 Mq for highly flatted early-type systems at all redshifts. This implies that the process 
that grows an early-type galaxy above this ceiling mass, irrespective of cosmic epoch, involves forming 
round systems. Using both parametric and non-parametric tests, we find no evolution in the projected 
axis-ratio distribution for galaxies with masses > 3 x 10 10 Mq with redshift. At the same time, our 
samples imply an increase of 2-3 x in co-moving number density for early- type galaxies at masses 
> 3 x 10 10 Mq, in agreement with previous studies. Given the direct connection between the axis- 
ratio distribution and the underlying bulge-to-disk ratio distribution, our findings imply that the 
number density evolution of early-type galaxies is not exclusively driven by the emergence of either 
bulge- or disk-dominated galaxies, but rather by a balanced mix that depends only on the stellar mass 
of the galaxy. The challenge for galaxy formation models is to reproduce this overall non-evolving 
ratio of flattened to round early-type galaxies in the context of a continually growing population. 



1. INTRODUCTION 

At low redshifts, by number the dominant early- 
type galaxy is a disky system , commonly classified 
as SO galaxies (jDresslerl Il980t iMarinoni et al.l Il999f) 
or di sky elliptical galaxies (see iKormendy fc Djorgovskil 
1989, for a summary). These galax ies are smooth, 
but have significant rotation al support (|Krainovic et al.l 
120081: lEmsellem etail 120111 ). In addition, these galax- 
ies have b ulges, which can often contain 50% of the 
light (e.g. lLau rikaincn et all l2010f) . but the presence 
of disk sets these galaxies apart from the more mas- 
sive elliptical galaxies. In contrast, the most mas- 
sive galaxies are generally much rounder systems that 



are t r iaxial (e.g., Franx et al. 1 1 9 9 It f jorgens en fc Franxl 



1995 iVincent Rvdedl20ul lvaT"der Wel et al.l 120091; 
Bernardi et al.ll2010HEmsellem et al.ll201lD . The forma- 
tion process of the early-type population must not only 
stop star-formation, but must form galaxies with a vari- 
ety of appa rent shapes. 

Recently. Ivan der Wel et~aT1 (|2009l . vdW09) found that 
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there is a threshold mass for the formation of early-type 
disk g alaxies, a result which ha s been indicated in earlier 
work (I Jorgensen fc Franx 1994T). For galaxies with stellar 
masses below ~ 10 11 Mq, there is a broad distribution 
of projected axis-ratios. This implies that intrinsically 
flat systems, such disks or flattened ellipticals, populate 
these masses. Above th at mass, however, ga laxies be- 
come distinctly rounder. iBernardi et al.l (2010) find that 
this threshold mass is apparent in not just the projected 
axis-ratio, but in properties of the stellar population such 
as the color and color-gradients. The implication is that 
the most massive passively evolving galaxies are the re- 
sult of a different formation process than the galaxies 
below this mass threshold. If one assumes that round 
galaxies are a result of mergers, than the apparent ceil- 
ing in the mass distribution of disk galaxies reflects the 
limit in mass for disky systems. The process of galaxy 
formation, therefore, sets a mass scale above which mul- 
tiple mergers are the apparently only method of mass 
assembly. 

There is significant evidence for evolution of number 
densi ty of passively evolving L* galaxies in field sur- 
veys (I Wolf et alJl2003l:[Bell et al.ll200llBorch et al.ll2006! 
[ Brown et al.l 120071: IFaber et al.l 120071: iCirasuolo et al 
l2007Hllbert et al.ll2010fc iBrammer et al.ll2011| ). This evo- 
lution can occur via two paths, namely the process of 
merging building up red-sequence or of star-forming spi- 
ral galaxies being transformed into red-seq uence galax- 
ies w ith the cessation of star- formation (see IFaber et al.l 
120071 for a more thorough discussion) . By examining the 
evolution of the threshold mass as found by vdW09, we 
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can trace out the relative importance of these different 
formation channels over time. We have investigated this 
question by compiling mass-limited samples of passively 
evolving galaxies from 2: = to z = 1 in a variety of 
environments. We will use the projected axis-ratio mea- 
surem ents, which have been rob ustly tested in previous 
work (jHolden et all [20071 l200l hereafter H09), to de- 
termine the evolution in the distribution of disky and 
apparently round galaxies with redshift. 

Throughout this paper, we assume ft m = 0.27, J7a — 
0.73 and H a = 71 km s _1 Mpc -1 . All stellar mass es- 
timat es are done usin g a Chabrier initial mass function 
QMF. [Oiabrien 120031) . 

2. DATA 

2.1. Parent Sample Selection 

We construct stellar-mass limited samples of early-type 
or quiescent field galaxies with HST imag ing at red- 
shifts z ~ 0.7 from GEMS (|Rix et al.ll200l . COSMOS 
(jScoville et al .1120071 ) , and complement this wi th a sample 
of low -redshift counterparts from the SDSS ([York et al.1 
[2000f K Before we estimate stellar masses and select quies- 
cent galaxies, we construct paren t samples from existing 
catalogs. From the SDSS DR7 (|Abazaiian et all 12003) 
we select all galaxie^B in the redshift range 0.04 < z < 
0.08. 

For GEMS, which overlaps largely with the extended 
Chandra Deep Field-South (E-CDFS), we take the pub- 
lic catalog with photometry a nd derived quantit i es suc h 
as photometric redshifts from iCardamone et all (|2010t ) . 
As a pre-selection we require that galaxies are in the pho- 
tometric redshift range 0.6 < z < 0.8, have 2a K-band 
detections, which removes those objects in noisy parts of 
the image, lie within the B, V, and R images, and have 
good photometric redshifts. (x 2 < 250, which removes 
objects with deviating spectral energy distributions such 
asAGN). 

For COS MOS, we use the p ublic photometric redshift 
catalog by lllbert etlrtl (|2009D . Again, as a pre-selection 
we require galaxies to lie within the photometric redshift 
range 0.6 < z < 0.8, have errors on the r, i, z, J, and 
K magnitudes less than 0.3 mag, and errors on the g 
magnitude less than 0.5 mag. 

2.2. Rest-frame Magnitudes 

For each galaxy, we need to convert the observed pho- 
tometry into the equivalent photometry as if the galaxy 
were observed at a redshift of z = 0. We will refer to 
these magnitudes and colors with subscript 0, such as 
(u — r)o- To compute these values, we build on the 
approach we have u sed in the past, s ee fo r example 
iBlakeslee et~aT1 (|2006l ) , or iHolden et all (l20lol). T o com- 
pute the conversion, we use IBruzual fc Chariot! (|200l 
stellar populations with a variety of r models and a 
range of metallicities. At each redshift of interest, we 
compute the relation between a pair of observed magni- 
tudes and a rest-frame magnitude of the following form 
mo = m obsi + A(m obsi - m obS2 ) + B where m a is the 
magnitude of interest at z = and m obsi is the observed 
magnitude that is closest to covering the same portion 

We use the 'Galaxy' table of the DR7 data release accessible 
through CAS jobs. 



of the galaxy spectral energy distribution at a given red- 
shift z. We then fit the distribution of coefficients, A 
and B, as splines as a function of redshift z and ob- 
served color m b Sl — rn }, S2 . By fitting a spline to the 
coefficients, we can calculate a unique conversion for each 
galaxy based on its observed redshift and colors. Then, 
we add 2.51og 10 (l + z) to each magnitude. 

2.3. Color-Color Selection of Quiescent Galaxies 

To select galaxies that have very little or no star- 
formation activity, we follow the now commonly adopted 
approach to define a region in color-color space that ef- 
fect ively separates such galaxies from star-forming galax- 
ies dWuvts et al.|[2007t iWilliams et all [20091: IWolf et al.l 
l2009UWhitaker et al.ll2010fc IPatel et al.ll2010D . In Figure 
[I] we show the rest-frame (u — r)o and (r — z)o color dis- 
tribution of the full SDSS spectroscopic galaxy sample at 
0.04 < z < 0.08, computed as described above from the 
SDSS model magnitudes. Quiescent galaxies populate a 
small region of color-color space as indicated by the pro- 
nounced dark area in the figure. Star-forming galaxies 
populate a much more ex tended, yet fairly narrow se- 
quence. IPatel et al.l (|2011l ) has shown that the galaxies 
in the quiescent region are morphologically like early- 
type or elliptical and SO galaxies in the redshift range of 
0.6 < z < 0.8. Therefore, for the rest of this paper, we 
will refer to galaxies in that region as early-type systems. 

2.3.1. Determining the Color-color Selection Criteria 

In order to define the optimal color-color selection cri- 
teria and establish the reliability of using these crite- 
ria to select quiescent galaxies, we compare the color- 
color distribution with the Ha detection rate. In Fig- 
ure Q] we show the color-color distribution of galaxies 
that are detected in Ha (at the 2.5ct level: EW(Ha) > 
2.5aEw(Ha)) with blue contours and those that are below 
that detection threshold in red. 

We define a polygon to photometrically select quiescent 
galaxies as shown in the figure, with the location of the 
three segments - but not the slope of the tilted segment, 
which is chosen to run parallel to the star-forming se- 
quence - as free parameters. We then calculate the frac- 
tion of Ha emitters contained within the polygon (the 
contaminating fraction f c ) and the fraction of Ha- less 
galaxies outside the polygon (the missing fraction f m ) for 
a each set of polygon parameters. The optimal parame- 
ter values found by minimizing f c + f m + abs(f c — fm), 
which ensures that f c and f m are essentially equal while 
their sum is minimized. 

For the full spectroscopic sample, the optimal color- 
color selection criteria are described by the polygon 
(it - r) > 2.26, (r - z) < 0.75, (u - r) > 0.76 + 
2.5(r — z)q, with contaminating and missing fractions of 
fc ~ fm ~ 0.18. The color-color distribution peaks at 
(u — r)o = 2.55 and (r — z)o = 0.67, which is computed 
by finding the maximum in density of the color-color 
distribution after integrating over a circle with radius 
0.03 mag, the approximate relative error in the colors. 
If we pre-select galaxies with a certain minimum stellar 
mass (see below), the polygon does not change by more 
than 0.02 mag up until the minimum mass reaches very 
large values (~ 10 11 Af Q ). The contaminating and miss- 
ing fractions have a mild dependence, and increase from 
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~ 0.18 to ~ 0.20 if the minimum mass increases from 
2 x 10 10 M Q to 10 11 M Q . 

A simple red-sequence selection has a much higher con- 
tamination rate. If we use only the u — r selection, our 
sample has a higher success rate, we miss only 0.03 of 
the early-type sample, but at a cost of a contamination 
fraction of 0.37. Such a high fraction of star-forming 
galax ies in red-sequence selections has been seen before 



galaxies m red-sequence 
(e.g.. lMaller et al.ll2009ft 



2.3.2. The Color-color selection for 0.6 < z < 0.8 Sample 

We use the above results for the SDSS sample to de- 
fine the appropriate color-color selection criteria for the 
GEMS and COSMOS surveys. For consistency with the 
SDSS, we derive rest-frame (u — r)o and (r — z)q color 
(as well as stellar masses, see below) using the public 
photometric redshift estimates for both the GEMS and 
COSMOS surveys. The color-color distributions for the 
GEMS and COSMOS samples are shown in Figure 
where a similar peaked distribution in color-color space 
indicates the presence of a quiescent population of galax- 
ies. Of course, we do not have the luxury to compare 
color-color selection criteria with Ha line strengths at 
z ~ 0.7. Instead we shift the polygon defined above in 
both (u — r)o and (r — z)q by the difference in the location 
of the density peaks in the color-color distribution be- 
tween high-z samples and the SDSS sample. For GEMS, 
the color-color distribution peaks at (u — r)o — 2.24 and 
(r - z) = 0.61, and for COSMOS at (u - r) = 2.28 
and (r — z)o = 0.70. As with the SDSS, we determine 
these centroids by integrating over a circle with radius 
0.03 mag. These numbers do not change by more than 
0.01-0.02 mag in case errors in the color of 0.05 ma are 
assumed. 

There is an unfortunate difference in the locations of 
the quiescent galaxies in color-color space between the 
GEMS and COSMOS samples. This difference should be 
attributed to systematic effects in the photometry, which 
would be of the level of ~ 10%. However, our method 
to define the color-color selection criteria for quiescent 
galaxies ensures that these systematic effects do not af- 
fect our sample selection and analysis. 

2.4. Stellar Mass Estimates 



The DR7 catalog of lBrinchmann et al.l (|2004l ) provides 
stellar mass estimates for the galaxies in our SDSS sam- 
ple. These estimates use a new methodology for DR7, 
namely the spectral energy distributions are fit to the 
broad band photom etry similar to the implementation of 
iSalim et al.l (|2007j ) instead of using only the spectra as 
was d one in the past. T hese diff erences are d ocumented 
at http://www.mpa-garching.mpg.de/SDSS/DR7/, The 
model fit to the photometric data yields a distribution. 
We select the median mass value as the best estimate 
with the 16 and 84 percentile confidence limits as esti- 
mates of the errors. 

In Figure [3] can be seen that quiescent galaxies obey 
a tight relation between color and stellar mass-to-light 
ratio - this is obviously inherent to the method. We de- 
rive a linear relation by computing the biweight mean 
in narrow color bins (0.02 mag wide) over the range 
0.64 < (g — r)o < 0.82, where the sequence is well pop- 
ulated. A fit that minimizes the scatter in the biweight 



yields: log 10 (M/L ff ) = -1.024+ 1.966(g-r) . The scat- 
ter decreases from a = 0.10 to a = 0.04 in log 10 (M/L g ) 
over the probed color range. 

For our high redshift samples, we would like to esti- 
mate stellar masses consistently. However, we cannot 
directly use the above relation because quiescent galax- 
ies at z ~ 0.7 have had different star formation histories 
than their present-day counterparts, because, if nothing 
else, the universe was younger. We use independent con- 
straints on the evolution of (g — r)o and M/L g to convert 
the above relation between color and M/L for z ~ 0.06 
galaxies to one appropriate for z ~ 0.7 galaxies. We 
take the evolution in M/L g fro m the recently derive d 
fundamental plane evolution by iHolden et al.l ([2010) : 
dlog 10 M/L g = (-0.60 ± 0.04)2. We infer the (g - r) 
color evolution by computing the biweight mean of the 
photometrically selected SDSS, GEMS, and COSMOS 
samples, for which find, respectively, {g—r)o = 0.75, 0.61, 
and 0.60. Thus, we shift the relation between (<? — r)o and 
M/L g shown in Figure 1 to the left by 0.14 (COSMOS) 
or 0.15 (GEMS) mag, and down by 0.38 dex, resulting in 
stellar mass estimates that are ~ 0.1 dex lower at z ~ 0.7 
than those for z ~ 0.06 at the same (g — r)o color. 

To estimate errors on the stellar masses for the high 
redshift samples, we use the scatter in the M/L g versus 
g — r relation at a the color of a given galaxy. This 
scatter is done in the relation after it is shifted by the 
mean g — r color and M/L evolution. We then add in 
quadrature the uncertainties in the zero point of the M/L 
evolution. Thus the error in the high redshift stellar 
masses range from 0.06 to 0.11 dex depending on the 
color of the galaxy. 

If use the M/L evolu tion from the field sample of 
van der Wei et all (pfjol . we would estimate smaller 
masses at z = 0.7 by 0.02 dex, a small systematic shift. 
This differen ce is in good agreement w ith the statistical 
errors of the Ivan der Wei et all (|2005| ) and IHolden et al.l 
(|2010f) samples. 

2.5. Projected Axis-Ratio Measurements 

Our projected axis-ratio measurements, q pro j, come 
from two approaches. For the GEMS and COSMOS sam- 
ples, these v alues are the result of fitting Sersic models 
(|Sersid H968D to the two-dimensio nal images using the 
software GALFIT ()Peng et alj|2002t) . For the SDSS cata- 
log, we use the estimates from fitting a de Vaucouleurs 
model as part of the SPSS DR7 photometric pipeline 
(jAbazajian et alJl2009h . 

The GEMS fits are contained in the catalog of 
lHaussler et all (|2007fh These are done using GALFIT as 
part of larger package known as GALAPAGOS^. GALAPAGOS 
has the advantage of automating the catalog construc- 
tion and model fitting process. The software automat- 
ically fits neighboring objects and incorporates its own 
sky subtraction algorithm in order to ensure robust pa- 
rameter me asurements. A more t horough discussion can 
b e found in Hiiusslcr et al. (|2007h . 

iGriffith fc Sternl (|2010l T used GALAPAGOS to produce 
publicly available catalogs for both COSMO S and 
GEMS. We will use the catalog oflGriffith &; Sternl (f2010h 
for COSMOS and the catalog of lHaussler et alJ (|2007f ) for 
GEMS, though, as we show below, there is no measurable 

6 http:/ /astro-staff.uibk.ac.at/~m.barden/galapagos/ 
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Fig. 1. — The u — r versus r — z relation for galaxies in the 0.04 < z < 0.08 from our SDSS sample in grey scale. Contours show the 
density of galaxies with < 2.5EW (Ha)/a(EW(Ha)) (red) and without < 2.5EW(Ha)/a(EW(Ha)) (blue). The contours are at 100, 
300, and 1000 galaxies per color bin. The black lines have been chosen to minimize the difference between the number of galaxies with 
< 2.5EW(Ha)/a(EW(Ha)) within the box and the number of galaxies without < 2.5EW(Ha)/a(EW(Ha)) outside the box. This 
ensures robust separation of star-forming, late- type galaxies and quiescent, early- type galaxies even when spectroscopic information is not 
available, for example at higher redshifts. 

galaxy magnitude. 

In addition to data-related differences, the fitting al- 
gorithms also differ between the low- and high-redshift 
galaxy samples. We use the adaption of GALAPAGOS for 
SDSS imaging from IGuo et al.l pOOl to measure the 
axis-ratios of a sub-sample of our SDSS galaxies in a 
manner that is fully consistent with the treatment of 
the high-redshift galaxies. For small axis-ratios, system- 
atic differences are expected to be largest. Therefore, 
we select 412 SDSS galaxies from our sample with axis- 
ratios 0.3 < q pro j < 0.305 as determined by the SDSS 
pipeline. These represent the extreme end of the distri- 
bution, where systematic differences in fitting procedures 
or the point spread function should be the most mani- 



difference between th e iGriffith fc Sternl (|2010l ) and the 
lHaussler et al.l (|2007t) catalogs. 

2.5.1. Consistency and Reliability of Projected Axis-Ratio 
Measurements 

Our analysis hinges on the consistent measurement of 
the axis-ratios of galaxies at very different redshifts and 
from very different imaging data sets. In iHolden et al.l 
(2009), we tested our our axis-ratio measurements from 
GALFIT with simulations of observations of high redshift 
galaxies using real low redshift galaxies as templates. We 
found these measurements to be robust, with a negligible 
shift in the axis-ratio from z = to z — 1 of Sq pro j ~ 
—0.01 with a scatter of o q ~ 0.01 — 0.03 depending on 
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Fig. 2. — The color-color relation for galaxies in the 0.6 < z < 0.8 for COSMOS (left) and GEMS (right). The black lines show our color 
selection based on the lines from Fig. [l] shifted by the median colors of the quiescent population. As with Figure [T] the data have been 
lightly smoothed with a 1.5 pixel FWHM Gaussian before we plot the grey scales. 
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Fig. 3. — The g — r-M/L g relation for early-type galaxies in SDSS 
(M > 1.25 X 10 10 M ; 0.04 < z < 0.08) selected by their u-r 
and r — z colors (see Figure [TJ- The solid line is the best-fitting 
relation. To estimate masses for our z ~ 0.7 masses from COSMOS 
and GEMS in a consistent manner we use the dashed line which 
has the same slope but a different zero point to compensate for 
evolution in the color and the mass-to-light ratio as explained in 
the text. 

fest. The axis-ratio measurements from GALAPAGOS are 
fully consistent with the SDSS pipeline measurements: 
Slproj = Qgalapagos ~ qsDSS = 0.01 ± 0.03, where 
0.03 is the root mean-squared scatter. 

GALAPAGOS includes many free parameters that affect 
source detection, sky subtraction and treatment of neigh- 
bors. However, regardless of the different set ups that 



IHaussler et all (l200l and IGriffith fc Sterol (f20Toh used 
for the GEMS data set, there is no systematic difference 
between the two catalogs for the objects in our sample 
(Sqproj := 0.02 ±0.03). 

A final test of the robustness of our measurements is a 
comparison between axis-ratio measurements from those 
galaxies in the GEMS that lie within the much d eeper 
ACS images from GOODS (IGiavalisco et al.ll200l . The 
difference is 5q pro j — —0.01 with a scatter of <j q = 0.05. 
The negligible systematic difference is encouraging. 

Summarizing, all tests and simulations stress that our 
axis-ratio measurements across the different data sets 
at different redshifts and performed with different algo- 
rithms, are internally fully consistent. 

2.6. Completeness 

We would like to measure evolution of the axis-ratio 
as a function of stellar mass. In order to probe as far 
down the mass function as possible, we need to limit our 
sample to above the mass limit where the initial survey 
photometry is complete. 

For the SDSS, we find that our sample will have no 
bias as a function of redshift for early-type galaxies with 
a mass of 3 x 10 10 Mq for the whole redshift range of 
0.04 < z < 0.08. This means that galaxies above that 
mass range have an equal probability of being included 
regardless of redshift. For the mass of 1.25 x 10 10 Mq, 
this is correct for only the redshift range of 0.04 < z < 
0.06. For the rest of this paper, we will include the whole 
sample to first mass limit, 3 x 10 10 M©. For the mass 
range 1.25 x 10 10 M Q < M < 3 x 10 10 M Q we limit the 
redshift range to 0.04 < z < 0.06. 

At higher redshifts, we have two different samples with 
two different selections. The GEMS sample was selected 
using a R limiting magnitude. In contrast, the cata- 
log from COSMOS we use was based on a combina- 
tion of / and 3.6 /jm imaging. Combining the com- 
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pleteness computations from lllbert et al.l (|2010f ) with the 
3.6 fim magnitude distribution of galaxies with masses 
10 10 < M/Mq < 1.5 x 10 10 , we infer that our sample is 
complete in this mass range. We adopt 1.25 x 10 10 Mq 
as our mass limit. From a similar estimate for the GEMS 
sample we infer a completeness limit of 3 x 10 10 M Q . For 
the rest of this paper, when we compare samples with 
masses > 3 x 10 10 Mq, we will be comparing the whole 
SDSS sample with the combined GEMS and COSMOS 
sample. For masses below that limit, we are only com- 
paring a subset of the SDSS sample with 0.04 < z < 0.06 
with the COSMOS sample. 

2.7. Axis-Ratio Dependence of the Observations 

In star-forming disk galaxies, dust is often concen- 
trated in the mid-plane. This causes a strong color- 
depen dence in the axis-ratio. For example, IMaller et al.l 
( 2009) finds a slope of ^-0.5 in the u — r color with q pro j. 

For our sample of early-type SDSS galaxies, we find 
that the slopes of the u — r axis-ratio relation are small, 
-0.07 to -0.09 magnitudes, depending on galaxy mass. 
Thus, rounder galaxies are mildly bluer than thinner 
galaxies. These shallow slopes for our early-type popu- 
lation has two implications. First, the completeness our 
sample is not drastically impacted by the apparent axis- 
ratio of the galaxy. Second, that our sample of early-type 
galaxies are optically thin. This second criteria implies 
that the selection in color-color space removes galaxies 
with significant amounts of gas and dust, which is typi- 
cal for passively-evolving systems. 

Though the slopes of the u — r color with axis-ratio are 
small in our SDSS DR7 sample, they still could be the re- 
sult of our color-color selection. Therefore, we measured 
the relation of the u — r color with axis-ratio for the 
sample of galaxies with < 2.5 E W (H a) / a (EW (Ha)) 
for comparison. We found statistically indistinguishable 
slopes in the u — r versus q pro j relation. 

3. INTRINSIC SHAPES OF 0.04 < Z < 0.08 EARLY-TYPE 
GALAXIES 

We plot the axis-ratio distribution as a function of stel- 
lar mass for the 33086 galaxies in our final SDSS DR7 
sample in Figure |4j In that figure, we also plot the 
axis-ratio distributions in three broad mass bins. There 
are two obvious features in Figure 01 At high stellar 
mass, there is a clear absence of elongated objects, imply- 
ing the absence of high-mass, disk-dominated, passively- 
evolving galaxies at both z = 0.6 — 0.8 and at z ~ 0, 
as was seen in vdW09. Second, below a threshold 
mass of ~ 10 11 M , we find a much broader distribu- 
tion of galaxy axis-ratios, imply ing a significant popula- 
tion of disk-dominated galaxies ([van der Wei et aLl[20i0l 
vdWIO). 

3.1. A Parametric Description of the Data 

We model the observed, projected axis-ratio distri- 
bution to infer the intrinsic axis-ratio distribution, as- 
suming two types of toy models: triaxial systems 
and oblate spheroids. We use triaxial models because 
they are well motivated by other res ults which include 
galaxy kinematics (see, fo r example iFranx et al.l 119911 : 
van den Bosch eFaL| [2008i We use the triaxial model of 
Franx et al.l ( 19911) which has two components, a triax- 

2 — h 2 

iality, T — a -i_ -i , and an ellipticity e = 1 — c/a where 



a, b, and c are the three different axis making up the 
triaxial system, with a being the largest and c being the 
smallest. We assume that both of these quantities are 
distribution as Gaussian and so fit T , e, along with ot 
and er £ . 

Our second model is a subset of the triaxial model, 
namely the oblate spheroid model is defined by fewer pa - 
rameters. We use the formalism of lSandage et al.l (1970) 
to describe the oblate spheroid component of the distri- 
bution. The free values are the minimum apparent axis 
ratio, b, corresponding to thickness of the spheroid. Once 
again, we assume a Gaussian distribution around b of size 
fib. The oblate spheroid model can be reproduced by a 
triaxial model with T = and o~t — 0. 

3.1.1. The Fitting Process 

We fit a number of models to the data simultaneously, 
ranging from one to three. If we fit more than one, we 
add additional fractions, / which is simply the fraction of 
overall model distribution that is represented by a given 
component. To determine the best fitting model, we 
maximize the log likelihood assuming a Poisson distri- 
bution. Because the triaxial models are computation- 
ally intensive, we pre-compute the apparent axis-ratio 
distribution for a grid of values of T and e. To com- 
pare our data with models, we must then bin the data 
to same binning as our precomputed triaxial models, or 
bins of Sq pro j = 0.01. For a given set of model param- 
eters M = M(Ti, a?, 1, c e! i,T2, ....), we compute the 
relative probability for each bin in q pro j- We adjust the 
normalization such that the model has the same number 
of galaxies as the input sample, and can now compute the 
Poisson log likelihood log Li = n.i log(mi) — m ; — log(ni!) 
for each bin i where ni is the number of galaxies in that 
bin and rrii is the number predicted by the model with 
parameters M. We use a Monte Carlo Markov Chain to 
fit the distribution of axis-ratio values. 

3.1.2. Fitting the z ~ 0.06 Galaxy Sample 

We begin the fitting process with a single model, 
namely a triaxial distribution. In general, when fitting 
triaxial models to only the axis-ratio distribution, the 
triaxiality is not well constrained. This is a simple con- 
sequence of the fact the data are a single, projected value 
while our model contains two different axes. For galaxies 
with M < 10 11 Mq, we find that a single component is 
not an adequate description of the data, with the model 
producing a deficit of galaxies at larger q pro j values and 
an overabundance of galaxies at small q pr oj's- Above 
that mass threshold, a single triaxial model appears to 
match the data well, however. 

The addition of a second, oblate spheroid model 
matches the data much more closely. We plot these 
model fits, along with the SDSS data in blue, in Fig- 
ure |U Comparing the log likelihood values of the single 
model and two component model shows that the two 
component model is a better description of the data at a 
statistically significant level (> 3a). 

Finally, we add a third triaxial component. In general, 
whether we fit three independent triaxial components or 
an oblate disk with two independent triaxial components, 
the fitting process de-weights the third component. The 
resulting model is dominated by only two components, 
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Fig. 4. — The distribution of axis-ratios as a function of stellar mass for quiescent galaxies in the SDSS (top). When drawing the grey 
scale we normalize by the number of data points in each mass bin. Individual points represent galaxies where the number per bin would be 
less than three. The sample is complete to 3 X 10 10 M for 0.04 < z < 0.08, and to 1.25 X IP 1 " M P> for . 04 < z < 0.06. The narrowing of 
the distribution shows the threshold for early-type, disky galaxies, as seen by vdW09 and Bcrnardi ct al. (2010). In the bottom panels, we 
plot the differential distribution of axis-ratios in three mass bins (marked by the vertical lines in the top panel with the log of the range in 
solar mass given). The num ber o f early-type galaxies in each bin is given below the mass range. We over plot, in black, parametric models 
of the data described in Sec. 13. ll 



which make up >90% of the model. For the rest of the 
paper, we will use the two component model with one 
constrained to be an oblate spheroid for simplicity. 

3.2. Model Results for z ~ 0.06 

For the highest mass bin, we find a low oblate spheroid 
fraction, 13 ± 4 %. The best fitting model has T ~ 0.4 
near a triaxial model in the formalism of iFranx et al.l 
(1991). This is close to the results found in that paper, 
and what we expected for large, dispersion supported 
systems. 

Below 10 11 Mq, we can see the long, flat distribution 
to small axis-ratios well described by an oblate spheroid. 



In the middle mass bin, the fraction of oblate spheroids 
is 54 ± 6% and it grows to 70 ± 8% in the lowest mass 
bin. Interestingly, for the middle mass bin, the triaxial 
component has a very similar shape distribution as the 
triaxial component at higher masses. 

One of the more robust parameters for even the triaxial 
models are the minor to major axis-ratios. In Figure [5j 
we plot the inferred distribution from the Monte Carlo 
Markov Chain of c, or the minor to major axis-ratios. 
The two different components to our model are readily 
apparent, along with their relative weight. 

Most striking, in even the high mass bin, there are no 
really round galaxies. The modal value for c is inferred 



to be c ~ 0.65 or a ratio of 1.5 to 1. Such an apparently 
small minor axis-ratio is actually in good agreement with 
the data. The intermediate axis, assuming a T = 0.4 or 
so, will be b = 0.9 and so such systems will often appear 
to close to but not perfectly round, exactly as we see in 
Figure HI 

The second result is that, in all mass bins, there are 
no very thin galaxies. Among the star-forming galaxy 
population, disks can can thin c ~ 0.2 , especially at 
lower masses and even in red pass- bands (|Rvderj|200a 
IPadilla fe" Strauss 2008) . Our modal value for the lowest 
mass bin in Figure [5] is 0.25 and the median value for the 
whole of the distribution is 0.29. Thus, the low-mass, 
passively evolving population is ^50% thicker than sim- 
ilar mass active star-forming galaxies which have values 
more like 0.2. 

4. SHAPE EVOLUTION OF EARLY- TYPE GALAXIES 

The final 0.6 < z < 0.8 samples contain 1332 galaxies 
in total (171 from GEMS; 1161 from COSMOS) with 
masses greater than 1.25 x 10 10 Mq, though see Sec. 12.61 
for details on the completeness with mass. We will now 
use these samples to measure the evolution of the axis- 
ratio distributions for samples in a fixed mass range. 

4.1. The Shape Distribution of 0.6 < z < 0.8 Galaxies 

The combined sample of GEMS and COSMOS, along 
with the corresponding SDSS sample, can be seen in Fig- 
ure [SJ We plot the axis-ratio distribution as a function 
of stellar mass for the 0.04 < z < 0.08 and 0.6 < z < 0.8 
samples. In Figure [7j we plot axis-ratio distribution 
in three broad mass bins, showing both the differen- 
tial and cumulative distributions. There are two obvi- 
ous features in Figure [SI At high stellar mass, there is a 
clear absence of elongated objects, implying the absence 
of high-mass, disk-dominated, passively-evolving galax- 
ies at both z ~ 0.7 and at z ~ 0, as was seen in vdW09. 
Second, below a threshold mass of ~ 10 11 M Q , we find 
a much broader distribution of galaxy axis-ratios, imply- 
ing a significant popula tion of disk-dominated galaxies 
(jvan der Wei et al.ll201Cl vdWIO). 

We use the parametric models we discuss in Sec. l3.ll to 
fit the distribution of data in Fig. [71 Because of the much 
smaller sample size, we fit the data with both the same 
models as we did the SDSS but also freeze some subsets 
of the parameters. In every case, we find that, within 
the limits of the uncertainties from the fits, the data can 
be described by the same model at both redshifts. 

The clear similarity between the axis-ratio distribu- 
tions of our low- and high-redshift samples shows that a 
ceiling mass for quiescent, disk-dominated galaxies exists 
at least since z ~ 1, generalizing the low-redshift result 
from vdW09. It is clear that, above 10 11 M , we find 
few flat galaxies. In contrast, these galaxies make up a 
much larger proportion of the population at masses be- 
low 10 11 Mq. Therefore, at z ~ 0.7, there is the same 
threshold for the population of "disky" early-type galax- 
ies, that is found at z ~ 0. 

4.2. The Axis-Ratio Dependence of the Mass Function 

of Early-type Galaxies 

Previous work has found a significant amount of 
evolution in the mass function of early-type galaxies 



(IWolf et alJ l200a iBelTetall l200l IBoreh et al.1 [20061: 
Brown et alj 120071: iFaber et all 120071: iCirasuolo et al.1 
20071 lllbert et alJ I2010t iBrammer et all 120111) " Gen- 
erally, the evolution appears at the lower mass end. 
As we have found that the axis-ratio distribution also 
changes with mass, with more disk-like early-types at 
lower masses, we will investigate if the mass function 
evolution is different for different sub-populations as se- 
lected by axis-ratio. 

We fit the mass distribution using a Schechter func- 
tion. We use a standard maximum likelihood approach 
assuming a Poisson likelihood model and perform the fits 
over the mass range where our sample is volume com- 
plete. Each galaxy has its own error estimate for the 
mass measurement, so we convolve the Schechter func- 
tion individually to compute the likelihood distribution. 
Including the errors in the fitting process has the ad- 
vantage of not causing M* being forced to higher values 
because of the occasional statistical fluctuation in a mass 
measurement. 

4.2.1. Edge on systems with q pr oj < 0.4 

We select all galaxies with q pro j < 0.4 and masses 
> 3 x 10 10 Mq. This selects disk-dominated systems 
that are viewed close to edge-on but above the mass 
limit where we are complete for the whole volume of both 
samples. We fit the mass distribution with a Schechter 
function with a fixed value of a = -0.7 (jBell et al.H2003f) 
for both our 0.6 < z < 0.8 and SDSS sample. We find 
the value of log 10 M*/M Q = 10.68 ± 0.10 (errors come 
from bootstrapping the data) for quiescent galaxies with 
qproj < 0.4. This value lies within la of our 0.04 < 
z < 0.08 field sample of log 10 M*/M e = 10.58 ± 0.01. 
We confirm the lack of strong evolution by using Monte- 
Carlo simulations where we adjust the mass distribu- 
tion of the z ~ 0.06 sample and create sub-samples 
of the same size as our high-redshift sample with the 
same mass limits. From this we find that the typical 
mass of q pro j < 0.4 galaxies above our mass complete- 
ness limit can only shift by by ±0.06 dex, ~16%, in our 
0.6 < z < 0.8 sample. We also confirm this result using 
non-parametric tests, the Kolmogorov-Smirnov test and 
the Mann- Whitney test, which also show no significant 
difference in the two samples. 

4.2.2. Round systems with q vro j > 0.6 

We also look for evolution in the apparently round 
galaxy population, those with q pro j > 0.6. From our 
modeling results in Sec. 13.11 we expect that this popu- 
lation is a combination of those mostly triaxial systems, 
with intrinsic ratios of 2:3, and the more flattened, or 1:3, 
population that dominates at lower masses. Evolution in 
this population, if not mirrored in the q pro j < 0.4 popu- 
lation, would imply evolution in the more triaxial com- 
ponent of the population that dominates at high masses. 

When we examine the galaxies that are round, q pro j > 
0.6, we find significant (> 3<r) though mild amount evolu- 
tion in the mass function. For our SDSS sample, we find 
log 10 M*/M Q = 10.93 ± 0.01 while in our 0.6 < z < 0.8 
sample we find log 10 M* /Mq = 10.85 ± 0.02. As before, 
we confirm this result at the > 3cr with the Kolmogorov- 
Smirnov test and the Mann- Whitney test. We also con- 
firm this result by drawing sub-samples of galaxies from 
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Fig. 5. — The intrinsic axis-ratio distribution (dn/dc), where c denotes the smallest-to-largest axis-ratio, in three mass bins for early-type 
galaxies in the SDSS. These plots represent our inferred results weighted by the likelihood distribution combined with our Gaussian model 
parameters. The mo del u sed to infer these distributions consist of an oblate component (where c = b) and a triaxial component (where 
c = 1 — e) — see Sec. 13.11 for a detailed description of this model and how the results are represented here. While details depend on the 
modeling assumptions, high-mass early-type galaxies typically have intrinsic short-to-long axis-ratios of 2:3, while M* early-type galaxies 
typically have 1:3. 
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Fig. 6. — The distribution of axis-ratios as a function of stellar mass for quiescent galaxies for the SDSS (left) and our combined COSMOS 
and GEMS 0.6 < z < 0.8 sample (right). The SDSS data are the same as shown in Figure [4] In each, we over-plot the 10th percentile, 
median and 90th percentile axis-ratios as a function of mass for both the SDSS (blue) and the 0.6 < z < 0.8 sample (red). At masses of 
> 3 X 10 10 M@, the patterns appear indistinguishable in the two samples, with a steady tapering above 2 X 10 11 Mq yielding an effective 
ceiling mass fo r high elongated system. This implies that the threshold for early-type, disky galaxies seen at low redshift by vdW09 and 
IBernardi et alj 1120101) does not significantly evolve out z ~ 0.7. Correspondingly, below that ceiling mass, both samples show similar 
distributions. This implies that at z ~ 0.7 the M* early-type galaxy is moderately "disky", as we show in Figure [5] In contrast, the 
mass density of early-type galaxies grows between z ~ 0.7 and z = 0. Our result shows that the mass growth must roughly preserve the 
distribution of axis-ratios in each mass bin. 
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Fig. 7. — The axis-ratio distribution for our early-type galaxy samples in three mass bins, with top and bottom showing the normalized 
cumulative and differential distributions respectively. The SDSS data are plotted with a blue line, in both, and grey shading in the bottom 
panels, while, we show our combined sample of early-type galaxies at 0.6 < z < 0.8 in red. The total number of galaxies in the 0.6 < z < 0.8 
sample is given in the bottom panels, with the size of the SDSS samples for the same mass range given in Figure [4] We plot the median 
axis-ratio for the SDSS with a blue vertical line and the 0.6 < z < 0.8 sample as a red line. The axis-ratio distribution is statistically similar 
in all three mass bins, with only the lowest mass bin showing a hint of evolution. Because statistically similar models as in Fig. [5] describe 
the 0.6 < z0.8 sample, out to z~l, massive early-types have axis-ratios of 2:3 while early-types around M* show a thinner distribution 
closer to 1:3. 
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the q pro j > 0.6 and 0.04 < z < 0.08 SDSS sample of the 
same size as the 0.6 < z < 0.8 sample. We find that these 
sub-samples recover log 10 M* /Mq = 10.93 with a scatter 
of ±0.02 dex. The larger question is, does this evolution 
represent a change in the galaxy population or is it a 
result of our measurements? Our 0.6 < z < 0.8 stel- 
lar masses have a systematic uncertainty of ±0.04 dex. 
2Thus, the shift in the mass function we find is inter- 
esting but not statistically significant. Because this an 
error on the zero point of stellar masses, which we derive 
from evolution in the fundamental plane (see Sec. 12 .4[) . 
the systematic error on the stellar masses can be lowered 
in future work. This will confirm or refute this apparent 
evolution shape of the mass function of round early-type 
galaxies. 

4.2.3. The whole of the population 

As check on our mass functions, we fit for M* with a 
fixed a = —0.7 for whole of our 0.04 < z < 0.08 sample 
of passively evolving galaxies and find log 10 M* /tf w = 
10.87 ± 0.01 M Q , in good agreement with Be ll et al.l 
(2003) after accounting for differences in the IMF and 
h. We find log 10 M*/M Q = 10.85 ± 0.0 2 f or our high 
redshift sample, similar to iBorch et al.l ((2006). In Figure 
[U we show the mass functions for three selections in axis- 
ratio (qproj < 0.4, q pro j > 0.6 and all galaxies regardless 
of q P roj)- We also plot our estimate of the total num- 
ber density of galaxies per logarithmic density bin. It is 
clear that we recover the trend in the densi ty evolution 
of the passive galaxy population f o und b y lllbert et"aTI 
(|2010l ). We note that lllbert et al.l (|2010h found evolu- 
tion in a. We find that, because of our high mass limit 
of 3 x 10 10 Mq, we have little statistical constraint on 
the best fitting value of a. To improve our results would 
require implementing completeness corrections for both 
samples. Nonetheless, we reproduce M* with the same 
sample. Thus, despite our different methodology for de- 
termining stellar masses and different sample definitions, 
we find consistent results with other measurements of the 
mass function. 

4.3. The Bulge-to-Disk Ratio of the Population of 
Early-type Galaxies 

The average axis-ratio of a galaxy population is di- 
rectly deter mined by the populations average bulge-to- 
disk ratio ()Binnev fc M errifield 119981 ). assuming that 
bulges are drawn from a different axis-rati o distribution 
as co mpared with disks (see Figure 2 of iDutton et al.1 
1201 ll which shows that this is true for all but the high- 
est mass galaxies). Therefore, by examining the evo- 
lution of the axis-ratio of the population, we are de- 
termining whether or not the population becomes more 
bulge-dominated or disk-dominated as a function of time, 
though we cannot determine if this evolution happens for 
individual galaxies or because of a changing mix of bulge- 
to-disk ratios in the population. 

In the range 3 x 10 10 Mq < M < 10 11 M , we find 
no difference in axis-ratio distribution between the two 
redshift slices. In the lowest-mass bin (1.25 x 10 10 Mq < 
M < 3 x 10 10 Mq) we see a small but barely signifi- 
cant difference between the two samples, suggestive of a 
more disk-dominated population in the 0.04 < z < 0.08 
sample. Because of the low significance of the difference 
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Fig. 8. — Mass functions of the 0.04 < z < 0.08 (red filled circles) 
and 0.6 < z < 0.8 (blue open squares) samples, for three different 
axis-ratio (q pro j )selections. We over plot with solid lines the best 
fitting Schechter functions, all assuming a fixed a = —0.7. Our 
fitting process includes the errors on the stellar mass estimates. 
Reinforcing the visual impression from Figure [6] the mass function 
of flat galaxies has a much smaller value of M* than the mass 
function of almost round galaxies. There is also little evolution in 
the shape of the mass function except for possibly the q pro j > 0.6 
sample, see text for further discussion. In general, the volume 
density of galaxies evolves by a factor of ~ x2-3 at 10 Mp, , in 
good agreement with the evolution seen in Bundy et al. (2010) or 
lllbert e t al. (2010). This evolution is the same, regardless of the 
axis-ratio of the population. This requires that the growth in the 
mass density of galaxies, in the mass range we consider, must occur 
in a manner that preserves the overall shape distribution. 

(2.4c) and the fact that it occurs in the smallest mass 
bin where the completeness is lowest, we consider this 
difference an interesting but tentative result. 

5. DISCUSSION AND CONCLUSIONS 

5.1. A Universal Ceiling Mass for Flattened Early-type 

Galaxies 

vdW09 observed a ceiling mass of ~ 10 11 M© for 
disk-dominated, quiescent galaxies in th e pres ent-day 
universe. In accordance, IBernardi et al.l (|2010f ) found 
that early-type galaxies with very high masses (~ 2 x 
10 11 Mq) differ in many ways from those with lower 
masses (< 10 11 Mq). In this paper we show that a simi- 
lar transition mass exists at z ~ 0.7 and that its value has 
not shifted by more than 0.05 dex between z ~ 0.7 and 
the present. Thus, at all redshifts, roughly 40% of the 
stellar mass in early-type systems is contained in these 
relatively round systems. As expected from Figure |3J 
round systems (q pro j > 0.6) have a characteristic mass 
of M* ~ 9 x 10 10 Mq while highly flattened {q proj < 0.4), 
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passively evolving galaxies have M* ~4x 10 10 M©. We 
find no significant evolution in the value for M* between 
our two samples, only evolution in the co-moving number 
density. 

This mass ceiling has the same mass, or, in other 
words, M* does not evolve for more elongated or 
"disky" early-type galaxies despite the growth of the 
passively evolving population by a factor of ~ 2 — 
3 i n mass between z ~ 1 and t oday (see Figure 
81 iWolf et ail [20031 : iBell et all l200l iBorch et ail [2001 
Brown et al.l 20071: iFaber et al.l l2007t iCirasuolo et al.l 



2007tlllbert et al. 2010; Brammer_eTalJ[2011|). This has 



two implications. First, the progenitors of today's pop- 
ulation of massive early-type population were not more 
disk-dominated systems at z = 0.7 that faded into the 
passively-evolving population. Instead, these galaxies 
must already be almost round, roughly 2:3 in intrin- 
sic axis-ratio, galaxies before the truncation of star- 
formation (see iKocevski et al.l 120101 for candidate pro- 
genitors). Second, if merging builds up the population 
of galaxies above ~ 10 11 M©, that merging most cause 
them to become rounder systems. 

5.2. Implications for the Formation of Galaxies with 
M > 10 11 M Q 

At the highest masses, we find that, not only is there a 
lack of flattened or "disky" galaxies, but that the dis- 
tribution is consistent with a largely triaxial popula- 
tion. This can be seen by the lack of galaxies that are 
round in projection galaxies at high masses in Figure 
[4[ These apparently round galaxies are seen at lower 
masses, so we do know that the lower fraction of high 
mass, round galaxies not j ust a systematic measurement 
error. iPadilla fe Strauss! ([2008D found a similar result, 
but the lack of evolution we find means that this tri- 
axiality is set in the formation of these systems out to 
z ~ 1. This result, when combined with the observed 
evolution in the normalization of the mass function, and 
hints about the shape (see l4.2.2|) . this triaxial population 
is built up over the redshift range we observe, but is done 
so in such a way to produce similar shaped systems over 
that time. 

Massive ellipticals are assumed to form out of multi- 
ple mergers of near equal mass systems and the merger 
rate is e xpected to be high even at redshifts of z ~ 
0.7 (e.g. iDe Lucia fe Blaizotl [2007H . Detailed simula- 
tions with cosmological initial conditions show that ad- 
ditional mechanisms are required to reproduce the ob- 
serve d shapes and kinematic profiles of massive ellipticals 
(e.g. iBurkert et ail 120081: iNovakl 120081) . Minor mergers 
and tidal encounters also provide a mechanism for mak- 
ing the most massiv e quiescent galaxies appear round. 
IVulcani et al.l ([201 ID finds that the most massive cluster 
galaxies, objects too massive to be in our sample, are less 
round at high redshift. This points to observational ev- 
idence of the process of galaxies becoming rounder with 
time, possibly becau se of the mechanisms suggested in 
IBurkert et al.l (|2008[ ). but only for the rarest and most 
extreme of systems. 

5.3. Evolution of the ~ M* Early-type Population 

At masses < 10 11 M©, the early- type population be- 
comes more and more "disky" . This can be seen in two 



ways, first, we find more round galaxies, q pro j > 0.9. Sec- 
ond, we find more flattened systems, q pro j < 0.4. This 
can be seen in both the minimum axis-ratio we find in 
Figure[4l and, the distribution of c/a values we infer from 
our parametric modeling in Figure [5j 

Quiescent galaxies with masses that dominate the cos- 
mic stellar mass budget (3 x 10 10 M© < M* < 10 11 M©) 
show a broad but non-evolving range in axis-ratios, at 
both z ~ 0.7 and z ~ 0.06. The broad range in axis- 
ratios implies that the population can form through a 
number of channels. Because we find so little evolution 
in the axis-ratios, however, means that, whatever the 
mechanisms that form early-type galaxies in this mass 
range, they must have worked at similar rates across the 
last 7-8 Gyrs of look-back time. This evolution cannot 
explained entirely by the increase in the number of bulge- 
dominated galaxies (say, merger products), nor can it 
be explained entirely by the cessation of star formation 
in disk-dominated galaxies without structural changes. 
Several evolutionary processes that cause the formation 
of quiescent galaxies must contribute in order to explain 
the unchanging fractions of bulge- and disk-dominated 
quiescent galaxies. Moreover, the relative importance 
of the various evolutionary processes has not strongly 
changed over the past 7-8 Gyrs. This is reminiscent of 
the general result that the morphological mix of galax- 
ies of these masses does n ot significantly cha nge over the 
same time (vdW07, H09. IBundv"et^I][20Tol ). 

5.3.1. Growth in the Number Density Growth of Highly 
Flattened Systems 

Our work finds consistent evolution in the number den- 
sity of passively evolving galaxies with redshift. Most 
work finds significant evolution, factors of 2 or 3, in 
the number den sity of galaxies in the mass range of 
our sample(e.g lllbert et all 120101: iBundv et all 120101: 
iBrammer et al.l 1201 1| )~ The combination of a flattened 
population at low masses with the increased number 
density of galaxies with redshift says that, at lower 
masses, the buildup of the mass function of passively 
evolving galaxies, or early-types, is the build up of pas- 
sive disk-like galax ies, such as SOs or "disky" ellipticals 
([Bundv et aL|[2010h . 

How can we explain the existence and continued 
growth of a population of quiescent, flattened galax- 
ies? Gas stripping in group and clus ter environments 
has l ong been argued to play a role ([Spitzer fe Baadd 
I1951D . and was recently shown to ex plain the existence 
of th e morphology-density relation ([van der Wei et ail 
2010). Our tentative detection of an increased fraction 
of "disky", quiescent low-mass galaxies (< 3 x 10 10 M©) 
at late times may indicate that this process is becoming 
increasingly important at late cosmic epochs. 

5.3.2. Structural Properties and Merging 

It is clear that all disk-dominated, quiescent galax- 
ies cannot be the result of gas stripping, e specially 
those outside massive groups and clusters (e.g. iDresslerl 
1980). While this may be feasible in the form of 
efficient gas stripping from satellite galaxies, even i n 
sparser group environments (|van den Bosch et alj [2008) , 
the observed differences between disky quiescent galax- 
ies and star-forming spiral galaxies of the same mass 
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imply that the former are not, generally, stripped ver- 
sions of th e latter. Quiescent galaxies typically hav e 
fewer bars (lAguerri et a l. 2009; Laurikain en et abT[2 009). 
large r bulges (iDresslerl 119801: iChristlein fc ZabludofB 
120041 : iRvdenl 120061: lLaurikainen et all 12010ft and are 
more concentrated (jBundy et al 120 10ft than star-forming 
galaxies, though many of th ese properties are mass de- 
pendent (|Cheng et al.ll20lil ). Finally, the axis-ratio dis- 
tributions of star-forming galaxies are markedly differ- 
ent, much flatter as expected for an oblate spheroid 
population alone, t hen the distributions we observe for 
early-type galaxies (|Rvdenll2006l : lPadilla fc Straussll2008ft 
Thus, at least at higher masses, the truncation of star for- 
mation m ust be intimately linked with bulge growth (e.g. 
iBel 12008). even if a sizable stellar disks remains intact. 

Minor merging may provide a possible path, which 
would provide a natural explanation for our observa- 
tion that the mix of bulge- and disk-dominated quiescent 
galaxies remains unchanged at z < 1. The advantage of 
this mechanism is that minor merging is common, it pro- 
duces most o f the growth for m assive early-type galaxies 
(for example I Oser et "all 1201 lft . Second, the amount of 
minor mergers gro w bulges (e.g. iKauffmann et all 119931 : 
iBaugh et al.lll996ft . though th at depends on the gas con- 
tent of the smaller system (|Mihos fc Hernquisti Il994t 
Hopkins et aT1l2009ft. 

Because of this. iBundv et all (|2010ft suggests a two 
stage scenario. First, some feedback mechanism causes 
star-formation to cease. Second, because lower gas con- 
tent gala xies have more rapid bulge growth from minor 
mergers ([Hopkins et alj 12009ft . the resulting remnants 
are both passively evolving and bulge-dominated. In 
fact, bulge growth through minor merging may cease 
star formation as a result of gas exhaustion, some feed- 
back mechanism (possibly AGN), or the st abilization of 
a gaseous disk against st ar formation (e.g. iBower et al.l 
20061 ICroton"et~ai1 12006ft . The main problem with such 
a process is, however, that we find no evolution in the 
overall axis-ratio distribution. This means this two stage 
process must produce as many bulge-dominated systems 
as new disk-dominated systems are added to the early- 
type galaxy population. If the rate of these two processes 
are not in good agreement, we would see a change in the 
distribution with time, the opposite of what our data 
show. This argues that the bulge growth and disk trun- 
cation should go hand in hand. If merging is the domi- 
nate physical process, this would imply some component 
of the merging 

A dynamical process is required to turn the average 
star-forming galaxy into the typical early-type galaxy, 
for the reasons we list above. This process must gener- 
ate a larger bulge fraction and population with an axis- 
ratio distribution that is markedly different from the flat 
population seen for sta r-forming systems (jRvdenl [20061 : 
iPadilla fc Strauss! 120081 e.g.). The most likely mecha- 
nism is merging, as merging changes the axis-ratio of 
galaxies with low gas masses. Some combination of ma- 
jor merging and minor merging, with more emphasis on 
the later due to its larger frequency, is the most likely 
culprit for structurally transforming active star-forming 
galaxies into the passively evolving galaxies we observe 
both today and at z ~ 1. 

5.4. Future Directions 



Our study uses a simple measurement (the pro- 
jected axis-ratio) to come to arrive at far-reaching 
conclusions about the evolution of galaxy struc- 
ture. The caveat is that we rely on the assumption 
that flattened systems have significant rotational 
support. It also rests on the assumption that one 
number to characterize the intrinsic shape is a sensible 
approximatio n, allowing us to bypass bulge-disk de- 
compositions (iMacArthur et al.l [20081 : lLaurikainen et ail 
\2Xim . ISimard et all 12011ft and spatially resolved, ste l- 
lar dynamics (Ivan der Marel fc van Dok kvml l2007allbl: 
Krainovic et all 120081 : Ivan der Wei fc van der Marel 
2008), which are notoriously difficult at high redshift. 
So far, such studies support our conclusions, most explic- 
itly by the observation that the fraction of rotationally 
supported early-type gala xies is similar at z ~ 1 and in 
the p resent-day universe (jvan der Wei fc van der Marell 
12008ft . 

An interesting question is whether the absence of a 
significant population of very massive disk-like galaxies 
at z < 1 is a fundamental feature of galaxy formation. 
Perhaps under circumstances that are met at much ear- 
lier epochs than z ~ 1 such galaxies can and do exist, 
and the observations presented in this paper merely show 
that merging, either minor or major, is the only relevant 
mechanism to produce very massive galaxies at relatively 
recent epochs. Observations of significantly large sam- 
ples of very massive galaxies at z ~ 2 may provide an - 
swer and do show some hints (jvan der Wei et aLI 120 lift . 
The structural properties of galaxies at the epoch dur- 
ing which the star formation rate was highest will tell us 
whether galaxies with stellar masses M > 2 x 10 11 M Q 
are always bulge dominated. 

5.5. Conclusions 

In this paper we analyze the projected axis-ratio dis- 
tributions of early-type galaxies with stellar masses > 
1.25 x 10 10 M Q at z ~ 0.06 and z ~ 0.7. By model- 
ing the intrinsic distribution, we find that at least since 
z ~ 1, there is a stellar mass ceiling for flattened early- 
type galaxies. Above 10 11 M such galaxies are increas- 
ingly rare, both at the present day and at z ~ 0.7 (see 
Figures HI [7] and [5]). This suggests that at all cosmic 
epochs the dominant evolutionary channel for early-type 
galaxies with higher masses is a dynamical process that 
transforms systems with a 1:3 intrinsic axis-ratio into a 
rounder, triaxial system with a roughly 2:3 axis-ratio. 

Below that mass threshold, the early-type galaxy 
population becomes more and more dominated by 
flattened or disk-like systems, with roughly an axis-ratio 
of 1:3. This is manifest in the number of round galaxies 
as well as in the larger and larger number of galaxies 
that appear thin in projection. This geometric picture 
also fits very well with the kinematic evidence that 
shows that most such early types are 'rapid rotators', 
at least in the present-day universe. Once again, the 
axis-ratio distribution appears to evolve little out to 
z ~ 1. The non-evolving shape of the mass function of 
flat versus round galaxies we find in Figure [8] coupled 
with the overall growth of the normalization implies 
that M* early-type galaxies form in a similar way over 
the last 7 Gyrs. The growth mechanism must roughly 
double to triple the number of early- type galaxies, 
producing a mix bulge-to-disk ratios that varies with 
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galaxy mass, but the process must not vary with time 
in the mass range we study. The leading puzzle for 
early-type formation is a unifying model for how to 
explain this growth in mass density with so little change 
in the shapes of galaxies over the same look-back time. 
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